Fix memory leaks and performance improvements by John-194 · Pull Request #22 · wangyiqiu/dbscan-python

John-194 · 2025-06-17T17:33:40Z

Fixes

Major leak: hasEdge duplicates some new treeT objects without removing them. Fix - preallocate them. Note - I had trouble using locks here.
- Performance improvement: 1.16x;
- Memory improvement: from ~6.3 GB that accumulated during my test to ~550 MB;
Major leak: Not using Py_DECREF.
- Performance: Negligible;
- Memory improvement: from ~550 MB that accumulated during my test, a few MB;
Minor leak: Multiple threads cache the same data in rangeNeighbor at once, overwriting each other. Fix - the use of Locks.
- Performance: Negligible;
- Memory improvement: from a few MB (that would cause problems over long-term use) to no leakage;
Performance optimization: The Scheduler gets initialized every DBSCAN call. Fix: call it once.
- Performance improvement: 1.48x on top of the first one;
- Memory improvement: None;

For performance and memory testing, I used a point cloud of shape (12895, 2) with unique labels: [-1 0 1 2 3 4 5 6] in an n=2000 for loop.
A final test of n=60000 proved no increase in memory by Heaptrack. Total performance gain 1.72x.

wangyiqiu

Hi @John-194
Thank you for looking into these issues. I generally agree with your changes, just a few questions, could you take a look?

wangyiqiu · 2025-06-21T19:56:22Z

+
+  parallel_for(0, G->numCell(), [&](intT i) {
+    if (ccFlag[i]) {
+        trees[i] = new treeT(G->getCell(i)->getItem(), G->getCell(i)->size(), false);


Major leak: hasEdge duplicates some new treeT objects without removing them. Fix - preallocate them. Note - I had trouble using locks here.

Thank you. In hasEdge, the trees were allocated on demand to save memory in case they are not required. How are the trees getting duplicated without removed? From concurrent calls to hasEdge?

I think what was happening was that in hasEdge

if (!trees[n1]) trees[n1] = new treeT(cells[n1].getItem(), cells[n1].size(), false);//todo allocation, parallel if (!trees[n2]) trees[n2] = new treeT(cells[n2].getItem(), cells[n2].size(), false);//todo allocation, parallel

while one thread is doing new treeT, another thread sees trees[x] is empty, so it tries to do the same. Both threads assign the new treeT to the same index, one of them leaks.

This fix can be improved to work as intended, but currently it works, and speed is improved due to threads not doing duplicate work, so I moved on to fixing other areas.

wangyiqiu · 2025-06-21T20:00:50Z

-      floatT hop = sqrt(dim + 3) * 1.0000001;
-      nbrCache[bait-cells] = tree->rangeNeighbor(bait, r * hop, fStop, fWrap, true, nbrCache[bait-cells]);
+      // wait for other threads to do their thing then try again
+      std::lock_guard<std::mutex> lock(cacheLocks[idx]);


Minor leak: Multiple threads cache the same data in rangeNeighbor at once, overwriting each other. Fix - the use of Locks.

Thank you. Did you notice any performance degradation as a result of using mutex?

I did not notice a change in the performance. Threads should now be doing less unnecessary work, which should overcome the performance cost of mutex.

wangyiqiu · 2025-06-21T20:02:16Z

@@ -4,6 +4,27 @@
 #include "dbscan/pbbs/parallel.h"




Thank you for fixing this!

No problem! I also have a fix for Arm CPU crashing, which I will upload later. There is also a very rare segmentation fault (~1 in a million chance) that I am looking into.

John-194 added 4 commits June 17, 2025 18:54

Fix major memory leak

bab4be8

Fix Py_DECREF memory leak

6f9da83

Fix minor memory leak

cf01f87

Optimize scheduler initialization

2f9b33d

wangyiqiu reviewed Jun 21, 2025

View reviewed changes

wangyiqiu merged commit 3bfd391 into wangyiqiu:master Jun 22, 2025
4 of 5 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix memory leaks and performance improvements#22

Fix memory leaks and performance improvements#22
wangyiqiu merged 4 commits intowangyiqiu:masterfrom
John-194:leak_fixes

John-194 commented Jun 17, 2025 •

edited

Loading

Uh oh!

wangyiqiu left a comment •

edited

Loading

Uh oh!

wangyiqiu Jun 21, 2025 •

edited

Loading

Uh oh!

John-194 Jun 22, 2025 •

edited

Loading

Uh oh!

wangyiqiu Jun 21, 2025 •

edited

Loading

Uh oh!

John-194 Jun 22, 2025

Uh oh!

wangyiqiu Jun 21, 2025 •

edited

Loading

Uh oh!

John-194 Jun 21, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

John-194 commented Jun 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

wangyiqiu left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

wangyiqiu Jun 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

John-194 Jun 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

wangyiqiu Jun 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

John-194 Jun 22, 2025

Choose a reason for hiding this comment

Uh oh!

wangyiqiu Jun 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

John-194 Jun 21, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

John-194 commented Jun 17, 2025 •

edited

Loading

wangyiqiu left a comment •

edited

Loading

wangyiqiu Jun 21, 2025 •

edited

Loading

John-194 Jun 22, 2025 •

edited

Loading

wangyiqiu Jun 21, 2025 •

edited

Loading

wangyiqiu Jun 21, 2025 •

edited

Loading